Overview
Brought to you by YData
Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 33600 |
| Missing cells | 91479 |
| Missing cells (%) | 11.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.9 MiB |
| Average record size in memory | 184.0 B |
Variable types
| Text | 13 |
|---|---|
| Numeric | 8 |
| Categorical | 2 |
wins has constant value "0" | Constant |
budget is highly overall correlated with grossWorldWide and 2 other fields | High correlation |
grossWorldWide is highly overall correlated with budget and 2 other fields | High correlation |
gross_US_Canada is highly overall correlated with budget and 2 other fields | High correlation |
opening_weekend_Gross is highly overall correlated with budget and 2 other fields | High correlation |
MPA has 7976 (23.7%) missing values | Missing |
budget has 21785 (64.8%) missing values | Missing |
grossWorldWide has 15378 (45.8%) missing values | Missing |
gross_US_Canada has 16029 (47.7%) missing values | Missing |
opening_weekend_Gross has 18077 (53.8%) missing values | Missing |
directors has 359 (1.1%) missing values | Missing |
writers has 1576 (4.7%) missing values | Missing |
stars has 473 (1.4%) missing values | Missing |
genres has 382 (1.1%) missing values | Missing |
countries_origin has 366 (1.1%) missing values | Missing |
filming_locations has 6729 (20.0%) missing values | Missing |
production_companies has 1378 (4.1%) missing values | Missing |
Languages has 474 (1.4%) missing values | Missing |
budget is highly skewed (γ1 = 97.63614372) | Skewed |
id has unique values | Unique |
Movie Link has unique values | Unique |
nominations has 23453 (69.8%) zeros | Zeros |
oscars has 31503 (93.8%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-15 14:55:50.471094 |
|---|---|
| Analysis finished | 2025-01-15 14:56:15.704285 |
| Duration | 25.23 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
id
Text
Unique 
| Distinct | 33600 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 262.6 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.0647917 |
| Min length | 9 |
Unique
| Unique | 33600 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | tt0073195 |
|---|---|
| 2nd row | tt0073629 |
| 3rd row | tt0073486 |
| 4th row | tt0072890 |
| 5th row | tt0073692 |
| Value | Count | Frequency (%) |
| tt0073195 | 1 | < 0.1% |
| tt0072976 | 1 | < 0.1% |
| tt0072890 | 1 | < 0.1% |
| tt0073692 | 1 | < 0.1% |
| tt0072081 | 1 | < 0.1% |
| tt0073026 | 1 | < 0.1% |
| tt0072653 | 1 | < 0.1% |
| tt0073812 | 1 | < 0.1% |
| tt0073802 | 1 | < 0.1% |
| tt0073317 | 1 | < 0.1% |
| Other values (33590) | 33590 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 67200 | |
| 0 | 56288 | |
| 1 | 27140 | |
| 2 | 21158 | 6.9% |
| 6 | 20539 | 6.7% |
| 8 | 19768 | 6.5% |
| 7 | 18918 | 6.2% |
| 4 | 18615 | 6.1% |
| 5 | 18557 | 6.1% |
| 3 | 18415 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 237377 | |
| Lowercase Letter | 67200 | 22.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 56288 | |
| 1 | 27140 | |
| 2 | 21158 | 8.9% |
| 6 | 20539 | 8.7% |
| 8 | 19768 | 8.3% |
| 7 | 18918 | 8.0% |
| 4 | 18615 | 7.8% |
| 5 | 18557 | 7.8% |
| 3 | 18415 | 7.8% |
| 9 | 17979 | 7.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 67200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 237377 | |
| Latin | 67200 | 22.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 56288 | |
| 1 | 27140 | |
| 2 | 21158 | 8.9% |
| 6 | 20539 | 8.7% |
| 8 | 19768 | 8.3% |
| 7 | 18918 | 8.0% |
| 4 | 18615 | 7.8% |
| 5 | 18557 | 7.8% |
| 3 | 18415 | 7.8% |
| 9 | 17979 | 7.6% |
Latin
| Value | Count | Frequency (%) |
| t | 67200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 304577 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 67200 | |
| 0 | 56288 | |
| 1 | 27140 | |
| 2 | 21158 | 6.9% |
| 6 | 20539 | 6.7% |
| 8 | 19768 | 6.5% |
| 7 | 18918 | 6.2% |
| 4 | 18615 | 6.1% |
| 5 | 18557 | 6.1% |
| 3 | 18415 | 6.0% |
Title
Text
| Distinct | 31935 |
|---|---|
| Distinct (%) | 95.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 262.6 KiB |
Length
| Max length | 165 |
|---|---|
| Median length | 74 |
| Mean length | 16.206399 |
| Min length | 1 |
Unique
| Unique | 30564 ? |
|---|---|
| Unique (%) | 91.0% |
Sample
| 1st row | Jaws |
|---|---|
| 2nd row | The Rocky Horror Picture Show |
| 3rd row | One Flew Over the Cuckoo's Nest |
| 4th row | Dog Day Afternoon |
| 5th row | Shampoo |
| Value | Count | Frequency (%) |
| the | 10109 | 10.3% |
| of | 3405 | 3.5% |
| a | 1540 | 1.6% |
| and | 1122 | 1.1% |
| in | 1112 | 1.1% |
| to | 689 | 0.7% |
| love | 530 | 0.5% |
| 504 | 0.5% | |
| for | 405 | 0.4% |
| man | 402 | 0.4% |
| Other values (21418) | 78082 |
Most occurring characters
| Value | Count | Frequency (%) |
| 64300 | 11.8% | |
| e | 54733 | 10.1% |
| a | 37115 | 6.8% |
| o | 32784 | 6.0% |
| n | 29402 | 5.4% |
| i | 28755 | 5.3% |
| r | 28619 | 5.3% |
| t | 26109 | 4.8% |
| s | 21255 | 3.9% |
| h | 20333 | 3.7% |
| Other values (122) | 201130 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 387254 | |
| Uppercase Letter | 82617 | 15.2% |
| Space Separator | 64300 | 11.8% |
| Other Punctuation | 7034 | 1.3% |
| Decimal Number | 2513 | 0.5% |
| Dash Punctuation | 662 | 0.1% |
| Open Punctuation | 56 | < 0.1% |
| Close Punctuation | 56 | < 0.1% |
| Math Symbol | 16 | < 0.1% |
| Other Number | 12 | < 0.1% |
| Other values (4) | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 54733 | |
| a | 37115 | |
| o | 32784 | 8.5% |
| n | 29402 | 7.6% |
| i | 28755 | 7.4% |
| r | 28619 | 7.4% |
| t | 26109 | 6.7% |
| s | 21255 | 5.5% |
| h | 20333 | 5.3% |
| l | 19029 | 4.9% |
| Other values (44) | 89120 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 11074 | 13.4% |
| S | 7427 | 9.0% |
| M | 5600 | 6.8% |
| B | 5472 | 6.6% |
| C | 4906 | 5.9% |
| A | 4562 | 5.5% |
| D | 4510 | 5.5% |
| L | 4218 | 5.1% |
| H | 3633 | 4.4% |
| W | 3508 | 4.2% |
| Other values (27) | 27707 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2009 | |
| ' | 1761 | |
| . | 1314 | |
| , | 845 | |
| ! | 482 | 6.9% |
| & | 337 | 4.8% |
| ? | 175 | 2.5% |
| / | 56 | 0.8% |
| * | 26 | 0.4% |
| ¡ | 8 | 0.1% |
| Other values (6) | 21 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 587 | |
| 1 | 416 | |
| 0 | 397 | |
| 3 | 311 | |
| 9 | 166 | 6.6% |
| 4 | 159 | 6.3% |
| 7 | 148 | 5.9% |
| 5 | 133 | 5.3% |
| 8 | 99 | 3.9% |
| 6 | 97 | 3.9% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 8 | |
| ³ | 2 | 16.7% |
| ² | 2 | 16.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 51 | |
| [ | 5 | 8.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 51 | |
| ] | 5 | 8.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 15 | |
| = | 1 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 64300 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 662 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 10 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 469872 | |
| Common | 74663 | 13.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 54733 | 11.6% |
| a | 37115 | 7.9% |
| o | 32784 | 7.0% |
| n | 29402 | 6.3% |
| i | 28755 | 6.1% |
| r | 28619 | 6.1% |
| t | 26109 | 5.6% |
| s | 21255 | 4.5% |
| h | 20333 | 4.3% |
| l | 19029 | 4.0% |
| Other values (82) | 171738 |
Common
| Value | Count | Frequency (%) |
| 64300 | ||
| : | 2009 | 2.7% |
| ' | 1761 | 2.4% |
| . | 1314 | 1.8% |
| , | 845 | 1.1% |
| - | 662 | 0.9% |
| 2 | 587 | 0.8% |
| ! | 482 | 0.6% |
| 1 | 416 | 0.6% |
| 0 | 397 | 0.5% |
| Other values (30) | 1890 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 543613 | |
| None | 922 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 64300 | 11.8% | |
| e | 54733 | 10.1% |
| a | 37115 | 6.8% |
| o | 32784 | 6.0% |
| n | 29402 | 5.4% |
| i | 28755 | 5.3% |
| r | 28619 | 5.3% |
| t | 26109 | 4.8% |
| s | 21255 | 3.9% |
| h | 20333 | 3.7% |
| Other values (75) | 200208 |
None
| Value | Count | Frequency (%) |
| é | 161 | |
| ä | 69 | 7.5% |
| ü | 60 | 6.5% |
| ö | 60 | 6.5% |
| á | 59 | 6.4% |
| í | 56 | 6.1% |
| è | 50 | 5.4% |
| å | 45 | 4.9% |
| ô | 43 | 4.7% |
| ó | 43 | 4.7% |
| Other values (37) | 276 |
Movie Link
Text
Unique 
| Distinct | 33600 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 262.6 KiB |
Length
| Max length | 37 |
|---|---|
| Median length | 36 |
| Mean length | 36.064792 |
| Min length | 36 |
Unique
| Unique | 33600 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | https://www.imdb.com/title/tt0073195 |
|---|---|
| 2nd row | https://www.imdb.com/title/tt0073629 |
| 3rd row | https://www.imdb.com/title/tt0073486 |
| 4th row | https://www.imdb.com/title/tt0072890 |
| 5th row | https://www.imdb.com/title/tt0073692 |
| Value | Count | Frequency (%) |
| https://www.imdb.com/title/tt0073195 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0072976 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0072890 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0073692 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0072081 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0073026 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0072653 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0073812 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0073802 | 1 | < 0.1% |
| https://www.imdb.com/title/tt0073317 | 1 | < 0.1% |
| Other values (33590) | 33590 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 201600 | |
| / | 134400 | 11.1% |
| w | 100800 | 8.3% |
| . | 67200 | 5.5% |
| i | 67200 | 5.5% |
| m | 67200 | 5.5% |
| 0 | 56288 | 4.6% |
| h | 33600 | 2.8% |
| c | 33600 | 2.8% |
| e | 33600 | 2.8% |
| Other values (16) | 416289 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 739200 | |
| Decimal Number | 237377 | 19.6% |
| Other Punctuation | 235200 | 19.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 201600 | |
| w | 100800 | |
| i | 67200 | 9.1% |
| m | 67200 | 9.1% |
| h | 33600 | 4.5% |
| c | 33600 | 4.5% |
| e | 33600 | 4.5% |
| l | 33600 | 4.5% |
| o | 33600 | 4.5% |
| b | 33600 | 4.5% |
| Other values (3) | 100800 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 56288 | |
| 1 | 27140 | |
| 2 | 21158 | 8.9% |
| 6 | 20539 | 8.7% |
| 8 | 19768 | 8.3% |
| 7 | 18918 | 8.0% |
| 4 | 18615 | 7.8% |
| 5 | 18557 | 7.8% |
| 3 | 18415 | 7.8% |
| 9 | 17979 | 7.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 134400 | |
| . | 67200 | |
| : | 33600 | 14.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 739200 | |
| Common | 472577 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 201600 | |
| w | 100800 | |
| i | 67200 | 9.1% |
| m | 67200 | 9.1% |
| h | 33600 | 4.5% |
| c | 33600 | 4.5% |
| e | 33600 | 4.5% |
| l | 33600 | 4.5% |
| o | 33600 | 4.5% |
| b | 33600 | 4.5% |
| Other values (3) | 100800 |
Common
| Value | Count | Frequency (%) |
| / | 134400 | |
| . | 67200 | |
| 0 | 56288 | |
| : | 33600 | 7.1% |
| 1 | 27140 | 5.7% |
| 2 | 21158 | 4.5% |
| 6 | 20539 | 4.3% |
| 8 | 19768 | 4.2% |
| 7 | 18918 | 4.0% |
| 4 | 18615 | 3.9% |
| Other values (3) | 54951 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1211777 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 201600 | |
| / | 134400 | 11.1% |
| w | 100800 | 8.3% |
| . | 67200 | 5.5% |
| i | 67200 | 5.5% |
| m | 67200 | 5.5% |
| 0 | 56288 | 4.6% |
| h | 33600 | 2.8% |
| c | 33600 | 2.8% |
| e | 33600 | 2.8% |
| Other values (16) | 416289 |
Year
Real number (ℝ)
| Distinct | 65 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1992.3936 |
| Minimum | 1960 |
|---|---|
| Maximum | 2024 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 1960 |
|---|---|
| 5-th percentile | 1963 |
| Q1 | 1976 |
| median | 1993 |
| Q3 | 2009 |
| 95-th percentile | 2022 |
| Maximum | 2024 |
| Range | 64 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 18.957395 |
|---|---|
| Coefficient of variation (CV) | 0.0095148845 |
| Kurtosis | -1.2140287 |
| Mean | 1992.3936 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | -0.02639572 |
| Sum | 66944426 |
| Variance | 359.38283 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2022 | 600 | 1.8% |
| 2008 | 600 | 1.8% |
| 2020 | 600 | 1.8% |
| 2024 | 600 | 1.8% |
| 2004 | 599 | 1.8% |
| 1960 | 598 | 1.8% |
| 2013 | 551 | 1.6% |
| 2019 | 550 | 1.6% |
| 1987 | 550 | 1.6% |
| 1965 | 550 | 1.6% |
| Other values (55) | 27802 |
| Value | Count | Frequency (%) |
| 1960 | 598 | |
| 1961 | 501 | |
| 1962 | 501 | |
| 1963 | 500 | |
| 1964 | 500 | |
| 1965 | 550 | |
| 1966 | 499 | |
| 1967 | 500 | |
| 1968 | 500 | |
| 1969 | 500 |
| Value | Count | Frequency (%) |
| 2024 | 600 | |
| 2023 | 550 | |
| 2022 | 600 | |
| 2021 | 500 | |
| 2020 | 600 | |
| 2019 | 550 | |
| 2018 | 500 | |
| 2017 | 550 | |
| 2016 | 500 | |
| 2015 | 500 |
Duration
Text
| Distinct | 230 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 221 |
| Missing (%) | 0.7% |
| Memory size | 262.6 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.8641361 |
| Min length | 1 |
Unique
| Unique | 52 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2h 4m |
|---|---|
| 2nd row | 1h 40m |
| 3rd row | 2h 13m |
| 4th row | 2h 5m |
| 5th row | 1h 50m |
| Value | Count | Frequency (%) |
| 1h | 27917 | |
| 2h | 4981 | 7.5% |
| 30m | 1728 | 2.6% |
| 35m | 1244 | 1.9% |
| 40m | 1122 | 1.7% |
| 33m | 1074 | 1.6% |
| 32m | 1038 | 1.6% |
| 38m | 978 | 1.5% |
| 31m | 962 | 1.5% |
| 36m | 959 | 1.5% |
| Other values (67) | 24091 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 33363 | |
| h | 33189 | |
| m | 32898 | |
| 32715 | ||
| 2 | 14674 | |
| 3 | 13905 | |
| 4 | 10170 | 5.2% |
| 5 | 8155 | 4.2% |
| 0 | 4452 | 2.3% |
| 8 | 3359 | 1.7% |
| Other values (9) | 8859 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 96918 | |
| Lowercase Letter | 66087 | |
| Space Separator | 32715 | 16.7% |
| Uppercase Letter | 14 | < 0.1% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 33363 | |
| 2 | 14674 | |
| 3 | 13905 | |
| 4 | 10170 | 10.5% |
| 5 | 8155 | 8.4% |
| 0 | 4452 | 4.6% |
| 8 | 3359 | 3.5% |
| 7 | 3090 | 3.2% |
| 6 | 2983 | 3.1% |
| 9 | 2767 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 5 | |
| P | 4 | |
| T | 2 | 14.3% |
| V | 2 | 14.3% |
| X | 1 | 7.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 33189 | |
| m | 32898 |
Space Separator
| Value | Count | Frequency (%) |
| 32715 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 129638 | |
| Latin | 66101 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 33363 | |
| 32715 | ||
| 2 | 14674 | |
| 3 | 13905 | |
| 4 | 10170 | 7.8% |
| 5 | 8155 | 6.3% |
| 0 | 4452 | 3.4% |
| 8 | 3359 | 2.6% |
| 7 | 3090 | 2.4% |
| 6 | 2983 | 2.3% |
| Other values (2) | 2772 | 2.1% |
Latin
| Value | Count | Frequency (%) |
| h | 33189 | |
| m | 32898 | |
| G | 5 | < 0.1% |
| P | 4 | < 0.1% |
| T | 2 | < 0.1% |
| V | 2 | < 0.1% |
| X | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 195739 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 33363 | |
| h | 33189 | |
| m | 32898 | |
| 32715 | ||
| 2 | 14674 | |
| 3 | 13905 | |
| 4 | 10170 | 5.2% |
| 5 | 8155 | 4.2% |
| 0 | 4452 | 2.3% |
| 8 | 3359 | 1.7% |
| Other values (9) | 8859 | 4.5% |
MPA
Categorical
Missing 
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7976 |
| Missing (%) | 23.7% |
| Memory size | 262.6 KiB |
| R | |
|---|---|
| Not Rated | |
| PG-13 | |
| PG | |
| Approved | 981 |
| Other values (21) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 3.7382922 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PG |
|---|---|
| 2nd row | R |
| 3rd row | R |
| 4th row | R |
| 5th row | R |
Common Values
| Value | Count | Frequency (%) |
| R | 10099 | |
| Not Rated | 4518 | |
| PG-13 | 3780 | 11.2% |
| PG | 3473 | 10.3% |
| Approved | 981 | 2.9% |
| Unrated | 905 | 2.7% |
| G | 768 | 2.3% |
| TV-MA | 264 | 0.8% |
| TV-14 | 194 | 0.6% |
| X | 148 | 0.4% |
| Other values (16) | 494 | 1.5% |
| (Missing) | 7976 |
Length
| Value | Count | Frequency (%) |
| r | 10099 | |
| not | 4518 | |
| rated | 4518 | |
| pg-13 | 3780 | 12.5% |
| pg | 3473 | 11.5% |
| approved | 981 | 3.3% |
| unrated | 905 | 3.0% |
| g | 768 | 2.5% |
| tv-ma | 264 | 0.9% |
| tv-14 | 194 | 0.6% |
| Other values (17) | 642 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 14617 | |
| t | 9941 | |
| G | 8362 | 8.7% |
| P | 7551 | 7.9% |
| e | 6406 | 6.7% |
| d | 6406 | 6.7% |
| o | 5499 | 5.7% |
| a | 5425 | 5.7% |
| N | 4582 | 4.8% |
| 4518 | 4.7% | |
| Other values (24) | 22483 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39415 | |
| Uppercase Letter | 39166 | |
| Decimal Number | 8129 | 8.5% |
| Space Separator | 4518 | 4.7% |
| Dash Punctuation | 4505 | 4.7% |
| Other Punctuation | 40 | < 0.1% |
| Math Symbol | 17 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 14617 | |
| G | 8362 | |
| P | 7551 | |
| N | 4582 | 11.7% |
| A | 1247 | 3.2% |
| U | 905 | 2.3% |
| V | 659 | 1.7% |
| T | 654 | 1.7% |
| M | 355 | 0.9% |
| X | 148 | 0.4% |
| Other values (4) | 86 | 0.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 9941 | |
| e | 6406 | |
| d | 6406 | |
| o | 5499 | |
| a | 5425 | |
| p | 1962 | 5.0% |
| r | 1886 | 4.8% |
| v | 981 | 2.5% |
| n | 905 | 2.3% |
| s | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4058 | |
| 3 | 3786 | |
| 4 | 194 | 2.4% |
| 7 | 78 | 1.0% |
| 6 | 8 | 0.1% |
| 8 | 5 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4518 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4505 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 40 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 78581 | |
| Common | 17209 | 18.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 14617 | |
| t | 9941 | |
| G | 8362 | |
| P | 7551 | |
| e | 6406 | |
| d | 6406 | |
| o | 5499 | 7.0% |
| a | 5425 | 6.9% |
| N | 4582 | 5.8% |
| p | 1962 | 2.5% |
| Other values (14) | 7830 |
Common
| Value | Count | Frequency (%) |
| 4518 | ||
| - | 4505 | |
| 1 | 4058 | |
| 3 | 3786 | |
| 4 | 194 | 1.1% |
| 7 | 78 | 0.5% |
| / | 40 | 0.2% |
| + | 17 | 0.1% |
| 6 | 8 | < 0.1% |
| 8 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95790 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 14617 | |
| t | 9941 | |
| G | 8362 | 8.7% |
| P | 7551 | 7.9% |
| e | 6406 | 6.7% |
| d | 6406 | 6.7% |
| o | 5499 | 5.7% |
| a | 5425 | 5.7% |
| N | 4582 | 4.8% |
| 4518 | 4.7% | |
| Other values (24) | 22483 |
Rating
Real number (ℝ)
| Distinct | 86 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 138 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.1551581 |
| Minimum | 1.1 |
|---|---|
| Maximum | 9.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 1.1 |
|---|---|
| 5-th percentile | 4.1 |
| Q1 | 5.5 |
| median | 6.3 |
| Q3 | 7 |
| 95-th percentile | 7.8 |
| Maximum | 9.6 |
| Range | 8.5 |
| Interquartile range (IQR) | 1.5 |
Descriptive statistics
| Standard deviation | 1.1460699 |
|---|---|
| Coefficient of variation (CV) | 0.18619666 |
| Kurtosis | 0.33014337 |
| Mean | 6.1551581 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | -0.56822661 |
| Sum | 205963.9 |
| Variance | 1.3134761 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.4 | 1274 | 3.8% |
| 6.6 | 1243 | 3.7% |
| 6.7 | 1237 | 3.7% |
| 6.5 | 1226 | 3.6% |
| 6.3 | 1219 | 3.6% |
| 6.2 | 1198 | 3.6% |
| 7.1 | 1177 | 3.5% |
| 6.1 | 1177 | 3.5% |
| 6.8 | 1141 | 3.4% |
| 7 | 1113 | 3.3% |
| Other values (76) | 21457 |
| Value | Count | Frequency (%) |
| 1.1 | 1 | < 0.1% |
| 1.2 | 2 | < 0.1% |
| 1.3 | 5 | |
| 1.4 | 2 | < 0.1% |
| 1.5 | 7 | |
| 1.6 | 7 | |
| 1.7 | 7 | |
| 1.8 | 7 | |
| 1.9 | 8 | |
| 2 | 5 |
| Value | Count | Frequency (%) |
| 9.6 | 2 | < 0.1% |
| 9.5 | 2 | < 0.1% |
| 9.4 | 4 | < 0.1% |
| 9.3 | 5 | < 0.1% |
| 9.2 | 9 | < 0.1% |
| 9.1 | 5 | < 0.1% |
| 9 | 10 | < 0.1% |
| 8.9 | 22 | |
| 8.8 | 31 | |
| 8.7 | 33 |
Votes
Text
| Distinct | 1758 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 138 |
| Missing (%) | 0.4% |
| Memory size | 262.6 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.2959178 |
| Min length | 1 |
Unique
| Unique | 218 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 683K |
|---|---|
| 2nd row | 173K |
| 3rd row | 1.1M |
| 4th row | 279K |
| 5th row | 15K |
| Value | Count | Frequency (%) |
| 1.1k | 577 | 1.7% |
| 1.2k | 451 | 1.3% |
| 1.3k | 443 | 1.3% |
| 1.4k | 412 | 1.2% |
| 1.5k | 373 | 1.1% |
| 1.6k | 365 | 1.1% |
| 11k | 364 | 1.1% |
| 1.7k | 346 | 1.0% |
| 1.9k | 320 | 1.0% |
| 1.8k | 317 | 0.9% |
| Other values (1748) | 29494 |
Most occurring characters
| Value | Count | Frequency (%) |
| K | 21150 | |
| 1 | 15050 | |
| . | 10847 | |
| 2 | 10794 | |
| 3 | 9104 | |
| 4 | 7871 | 7.1% |
| 5 | 7092 | 6.4% |
| 6 | 6827 | 6.2% |
| 7 | 6276 | 5.7% |
| 8 | 5966 | 5.4% |
| Other values (3) | 9311 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 78217 | |
| Uppercase Letter | 21224 | 19.2% |
| Other Punctuation | 10847 | 9.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 15050 | |
| 2 | 10794 | |
| 3 | 9104 | |
| 4 | 7871 | |
| 5 | 7092 | |
| 6 | 6827 | |
| 7 | 6276 | |
| 8 | 5966 | 7.6% |
| 9 | 5770 | 7.4% |
| 0 | 3467 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 21150 | |
| M | 74 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10847 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 89064 | |
| Latin | 21224 | 19.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 15050 | |
| . | 10847 | |
| 2 | 10794 | |
| 3 | 9104 | |
| 4 | 7871 | |
| 5 | 7092 | |
| 6 | 6827 | |
| 7 | 6276 | |
| 8 | 5966 | 6.7% |
| 9 | 5770 | 6.5% |
Latin
| Value | Count | Frequency (%) |
| K | 21150 | |
| M | 74 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 110288 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| K | 21150 | |
| 1 | 15050 | |
| . | 10847 | |
| 2 | 10794 | |
| 3 | 9104 | |
| 4 | 7871 | 7.1% |
| 5 | 7092 | 6.4% |
| 6 | 6827 | 6.2% |
| 7 | 6276 | 5.7% |
| 8 | 5966 | 5.4% |
| Other values (3) | 9311 |
budget
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 1140 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 21785 |
| Missing (%) | 64.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84543197 |
| Minimum | 1 |
|---|---|
| Maximum | 3 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 130000 |
| Q1 | 2000000 |
| median | 9000000 |
| Q3 | 27000000 |
| 95-th percentile | 1.15 × 108 |
| Maximum | 3 × 1011 |
| Range | 3 × 1011 |
| Interquartile range (IQR) | 25000000 |
Descriptive statistics
| Standard deviation | 2.866281 × 109 |
|---|---|
| Coefficient of variation (CV) | 33.903154 |
| Kurtosis | 10155.268 |
| Mean | 84543197 |
| Median Absolute Deviation (MAD) | 8193053 |
| Skewness | 97.636144 |
| Sum | 9.9887787 × 1011 |
| Variance | 8.215567 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000000 | 357 | 1.1% |
| 5000000 | 350 | 1.0% |
| 20000000 | 342 | 1.0% |
| 3000000 | 309 | 0.9% |
| 15000000 | 304 | 0.9% |
| 1000000 | 277 | 0.8% |
| 30000000 | 272 | 0.8% |
| 25000000 | 272 | 0.8% |
| 2000000 | 269 | 0.8% |
| 4000000 | 239 | 0.7% |
| Other values (1130) | 8824 | |
| (Missing) | 21785 |
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 1 | |
| 10 | 1 | |
| 20 | 1 | |
| 100 | 1 | |
| 220 | 1 | |
| 230 | 1 | |
| 260 | 1 | |
| 300 | 2 | |
| 400 | 1 |
| Value | Count | Frequency (%) |
| 3 × 1011 | 1 | < 0.1% |
| 3.5 × 1010 | 1 | < 0.1% |
| 3 × 1010 | 3 | |
| 2.8 × 1010 | 1 | < 0.1% |
| 2.4 × 1010 | 1 | < 0.1% |
| 1.9 × 1010 | 1 | < 0.1% |
| 1.5 × 1010 | 2 | |
| 1.22155 × 1010 | 1 | < 0.1% |
| 1.2 × 1010 | 2 | |
| 1 × 1010 | 2 |
grossWorldWide
Real number (ℝ)
High correlation  Missing 
| Distinct | 18033 |
|---|---|
| Distinct (%) | 99.0% |
| Missing | 15378 |
| Missing (%) | 45.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38149613 |
| Minimum | 1 |
|---|---|
| Maximum | 2.923706 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7790.85 |
| Q1 | 158993.75 |
| median | 2311544 |
| Q3 | 20929309 |
| 95-th percentile | 1.9391393 × 108 |
| Maximum | 2.923706 × 109 |
| Range | 2.923706 × 109 |
| Interquartile range (IQR) | 20770315 |
Descriptive statistics
| Standard deviation | 1.2101046 × 108 |
|---|---|
| Coefficient of variation (CV) | 3.1719971 |
| Kurtosis | 95.00837 |
| Mean | 38149613 |
| Median Absolute Deviation (MAD) | 2298895.5 |
| Skewness | 7.7779851 |
| Sum | 6.9516224 × 1011 |
| Variance | 1.4643531 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8148 | 13 | < 0.1% |
| 509 | 9 | < 0.1% |
| 97182 | 6 | < 0.1% |
| 14000000 | 4 | < 0.1% |
| 11000000 | 4 | < 0.1% |
| 3451 | 4 | < 0.1% |
| 18000 | 4 | < 0.1% |
| 1500000 | 3 | < 0.1% |
| 2970 | 3 | < 0.1% |
| 2735 | 3 | < 0.1% |
| Other values (18023) | 18169 | |
| (Missing) | 15378 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 9 | 1 | |
| 13 | 1 | |
| 15 | 1 | |
| 27 | 1 | |
| 32 | 1 | |
| 34 | 1 | |
| 42 | 2 | |
| 52 | 2 | |
| 54 | 1 |
| Value | Count | Frequency (%) |
| 2923706026 | 1 | |
| 2799439100 | 1 | |
| 2320250281 | 1 | |
| 2264750694 | 1 | |
| 2071310218 | 1 | |
| 2052415039 | 1 | |
| 1952723719 | 1 | |
| 1698772985 | 1 | |
| 1671537444 | 1 | |
| 1662020819 | 1 |
gross_US_Canada
Real number (ℝ)
High correlation  Missing 
| Distinct | 17211 |
|---|---|
| Distinct (%) | 98.0% |
| Missing | 16029 |
| Missing (%) | 47.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18082363 |
| Minimum | 64 |
|---|---|
| Maximum | 9.3666222 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 64 |
|---|---|
| 5-th percentile | 9192 |
| Q1 | 86036.5 |
| median | 909411 |
| Q3 | 14051372 |
| 95-th percentile | 92050430 |
| Maximum | 9.3666222 × 108 |
| Range | 9.3666216 × 108 |
| Interquartile range (IQR) | 13965336 |
Descriptive statistics
| Standard deviation | 48531806 |
|---|---|
| Coefficient of variation (CV) | 2.6839305 |
| Kurtosis | 62.143079 |
| Mean | 18082363 |
| Median Absolute Deviation (MAD) | 897342 |
| Skewness | 6.369239 |
| Sum | 3.1772521 × 1011 |
| Variance | 2.3553362 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8144 | 18 | 0.1% |
| 509 | 12 | < 0.1% |
| 5000000 | 9 | < 0.1% |
| 97182 | 9 | < 0.1% |
| 321875 | 7 | < 0.1% |
| 25000 | 6 | < 0.1% |
| 11000000 | 6 | < 0.1% |
| 22168 | 5 | < 0.1% |
| 2180000 | 5 | < 0.1% |
| 7500000 | 5 | < 0.1% |
| Other values (17201) | 17489 | |
| (Missing) | 16029 |
| Value | Count | Frequency (%) |
| 64 | 1 | |
| 80 | 1 | |
| 95 | 1 | |
| 153 | 1 | |
| 180 | 1 | |
| 211 | 1 | |
| 212 | 1 | |
| 256 | 1 | |
| 309 | 1 | |
| 347 | 1 |
| Value | Count | Frequency (%) |
| 936662225 | 1 | |
| 858373000 | 1 | |
| 814866759 | 1 | |
| 785221649 | 1 | |
| 718732821 | 1 | |
| 700426566 | 1 | |
| 684075767 | 1 | |
| 678815482 | 1 | |
| 674292608 | 1 | |
| 653406625 | 1 |
opening_weekend_Gross
Real number (ℝ)
High correlation  Missing 
| Distinct | 14751 |
|---|---|
| Distinct (%) | 95.0% |
| Missing | 18077 |
| Missing (%) | 53.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5110081.8 |
| Minimum | 11 |
|---|---|
| Maximum | 3.5711501 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 2787 |
| Q1 | 13996.5 |
| median | 107536 |
| Q3 | 3772558.5 |
| 95-th percentile | 25028754 |
| Maximum | 3.5711501 × 108 |
| Range | 3.57115 × 108 |
| Interquartile range (IQR) | 3758562 |
Descriptive statistics
| Standard deviation | 14883189 |
|---|---|
| Coefficient of variation (CV) | 2.9125148 |
| Kurtosis | 81.785598 |
| Mean | 5110081.8 |
| Median Absolute Deviation (MAD) | 104362 |
| Skewness | 7.2594433 |
| Sum | 7.93238 × 1010 |
| Variance | 2.2150931 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11623 | 18 | 0.1% |
| 11206 | 18 | 0.1% |
| 512000 | 10 | < 0.1% |
| 2482000 | 8 | < 0.1% |
| 11537 | 8 | < 0.1% |
| 6000 | 6 | < 0.1% |
| 7000 | 6 | < 0.1% |
| 6500 | 5 | < 0.1% |
| 15942 | 5 | < 0.1% |
| 8000 | 5 | < 0.1% |
| Other values (14741) | 15434 | |
| (Missing) | 18077 |
| Value | Count | Frequency (%) |
| 11 | 1 | |
| 30 | 1 | |
| 46 | 1 | |
| 80 | 1 | |
| 89 | 1 | |
| 92 | 1 | |
| 95 | 1 | |
| 107 | 1 | |
| 112 | 1 | |
| 141 | 1 |
| Value | Count | Frequency (%) |
| 357115007 | 1 | |
| 260138569 | 1 | |
| 257698183 | 1 | |
| 247966675 | 1 | |
| 220009584 | 1 | |
| 211435291 | 1 | |
| 208806270 | 1 | |
| 207438708 | 1 | |
| 202003951 | 1 | |
| 191770759 | 1 |
directors
Text
Missing 
| Distinct | 14520 |
|---|---|
| Distinct (%) | 43.7% |
| Missing | 359 |
| Missing (%) | 1.1% |
| Memory size | 262.6 KiB |
Length
| Max length | 85 |
|---|---|
| Median length | 75 |
| Mean length | 19.012184 |
| Min length | 6 |
Unique
| Unique | 9335 ? |
|---|---|
| Unique (%) | 28.1% |
Sample
| 1st row | ['Steven Spielberg'] |
|---|---|
| 2nd row | ['Jim Sharman'] |
| 3rd row | ['Milos Forman'] |
| 4th row | ['Sidney Lumet'] |
| 5th row | ['Hal Ashby'] |
| Value | Count | Frequency (%) |
| john | 893 | 1.2% |
| michael | 674 | 0.9% |
| david | 648 | 0.9% |
| robert | 625 | 0.8% |
| peter | 489 | 0.6% |
| richard | 424 | 0.6% |
| james | 378 | 0.5% |
| paul | 353 | 0.5% |
| de | 271 | 0.4% |
| lee | 265 | 0.3% |
| Other values (14951) | 71174 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 72226 | 11.4% |
| a | 43597 | 6.9% |
| 42953 | 6.8% | |
| e | 41588 | 6.6% |
| r | 33476 | 5.3% |
| [ | 33241 | 5.3% |
| ] | 33241 | 5.3% |
| n | 32396 | 5.1% |
| i | 31625 | 5.0% |
| o | 29137 | 4.6% |
| Other values (91) | 238504 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 365928 | |
| Other Punctuation | 77969 | 12.3% |
| Uppercase Letter | 77658 | 12.3% |
| Space Separator | 42953 | 6.8% |
| Open Punctuation | 33241 | 5.3% |
| Close Punctuation | 33241 | 5.3% |
| Dash Punctuation | 993 | 0.2% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 43597 | |
| e | 41588 | |
| r | 33476 | 9.1% |
| n | 32396 | 8.9% |
| i | 31625 | 8.6% |
| o | 29137 | 8.0% |
| l | 22035 | 6.0% |
| s | 17188 | 4.7% |
| t | 15757 | 4.3% |
| h | 14010 | 3.8% |
| Other values (43) | 85119 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 6629 | 8.5% |
| S | 6572 | 8.5% |
| J | 5920 | 7.6% |
| R | 5300 | 6.8% |
| C | 4971 | 6.4% |
| B | 4912 | 6.3% |
| A | 4861 | 6.3% |
| D | 4250 | 5.5% |
| G | 3891 | 5.0% |
| P | 3767 | 4.9% |
| Other values (29) | 26585 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 72226 | |
| , | 2978 | 3.8% |
| . | 2297 | 2.9% |
| " | 468 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 42953 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33241 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33241 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 993 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 443586 | |
| Common | 188398 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 43597 | 9.8% |
| e | 41588 | 9.4% |
| r | 33476 | 7.5% |
| n | 32396 | 7.3% |
| i | 31625 | 7.1% |
| o | 29137 | 6.6% |
| l | 22035 | 5.0% |
| s | 17188 | 3.9% |
| t | 15757 | 3.6% |
| h | 14010 | 3.2% |
| Other values (82) | 162777 |
Common
| Value | Count | Frequency (%) |
| ' | 72226 | |
| 42953 | ||
| [ | 33241 | |
| ] | 33241 | |
| , | 2978 | 1.6% |
| . | 2297 | 1.2% |
| - | 993 | 0.5% |
| " | 468 | 0.2% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 629207 | |
| None | 2777 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 72226 | 11.5% |
| a | 43597 | 6.9% |
| 42953 | 6.8% | |
| e | 41588 | 6.6% |
| r | 33476 | 5.3% |
| [ | 33241 | 5.3% |
| ] | 33241 | 5.3% |
| n | 32396 | 5.1% |
| i | 31625 | 5.0% |
| o | 29137 | 4.6% |
| Other values (51) | 235727 |
None
| Value | Count | Frequency (%) |
| é | 696 | |
| á | 306 | |
| ô | 300 | |
| ó | 226 | 8.1% |
| í | 212 | 7.6% |
| ö | 151 | 5.4% |
| ú | 120 | 4.3% |
| ç | 85 | 3.1% |
| ü | 84 | 3.0% |
| è | 70 | 2.5% |
| Other values (30) | 527 |
writers
Text
Missing 
| Distinct | 27123 |
|---|---|
| Distinct (%) | 84.7% |
| Missing | 1576 |
| Missing (%) | 4.7% |
| Memory size | 262.6 KiB |
Length
| Max length | 86 |
|---|---|
| Median length | 68 |
| Mean length | 34.883556 |
| Min length | 7 |
Unique
| Unique | 24654 ? |
|---|---|
| Unique (%) | 77.0% |
Sample
| 1st row | ['Peter Benchley', 'Carl Gottlieb'] |
|---|---|
| 2nd row | ["Richard O'Brien", 'Jim Sharman'] |
| 3rd row | ['Lawrence Hauben', 'Bo Goldman', 'Ken Kesey'] |
| 4th row | ['Frank Pierson', 'P.F. Kluge', 'Thomas Moore'] |
| 5th row | ['Robert Towne', 'Warren Beatty'] |
| Value | Count | Frequency (%) |
| john | 1441 | 1.1% |
| david | 1273 | 0.9% |
| michael | 1047 | 0.8% |
| robert | 986 | 0.7% |
| james | 702 | 0.5% |
| peter | 622 | 0.5% |
| paul | 621 | 0.5% |
| de | 586 | 0.4% |
| william | 575 | 0.4% |
| richard | 570 | 0.4% |
| Other values (26171) | 126263 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 126685 | 11.3% |
| 102662 | 9.2% | |
| a | 79105 | 7.1% |
| e | 73163 | 6.5% |
| r | 59068 | 5.3% |
| n | 57599 | 5.2% |
| i | 56030 | 5.0% |
| o | 50916 | 4.6% |
| l | 39304 | 3.5% |
| [ | 32024 | 2.9% |
| Other values (99) | 440555 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 647019 | |
| Other Punctuation | 164008 | 14.7% |
| Uppercase Letter | 137709 | 12.3% |
| Space Separator | 102662 | 9.2% |
| Open Punctuation | 32024 | 2.9% |
| Close Punctuation | 32024 | 2.9% |
| Dash Punctuation | 1662 | 0.1% |
| Decimal Number | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 79105 | |
| e | 73163 | |
| r | 59068 | 9.1% |
| n | 57599 | 8.9% |
| i | 56030 | 8.7% |
| o | 50916 | 7.9% |
| l | 39304 | 6.1% |
| s | 30019 | 4.6% |
| t | 28029 | 4.3% |
| h | 24295 | 3.8% |
| Other values (47) | 149491 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 11764 | 8.5% |
| S | 11556 | 8.4% |
| J | 10279 | 7.5% |
| B | 9074 | 6.6% |
| C | 8909 | 6.5% |
| R | 8607 | 6.3% |
| A | 8597 | 6.2% |
| D | 7632 | 5.5% |
| G | 7154 | 5.2% |
| L | 6550 | 4.8% |
| Other values (31) | 47587 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 126685 | |
| , | 31547 | 19.2% |
| . | 4778 | 2.9% |
| " | 998 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 5 | 1 | |
| 3 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 102662 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 32024 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 32024 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1662 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 784728 | |
| Common | 332383 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 79105 | 10.1% |
| e | 73163 | 9.3% |
| r | 59068 | 7.5% |
| n | 57599 | 7.3% |
| i | 56030 | 7.1% |
| o | 50916 | 6.5% |
| l | 39304 | 5.0% |
| s | 30019 | 3.8% |
| t | 28029 | 3.6% |
| h | 24295 | 3.1% |
| Other values (88) | 287200 |
Common
| Value | Count | Frequency (%) |
| ' | 126685 | |
| 102662 | ||
| [ | 32024 | 9.6% |
| ] | 32024 | 9.6% |
| , | 31547 | 9.5% |
| . | 4778 | 1.4% |
| - | 1662 | 0.5% |
| " | 998 | 0.3% |
| 8 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1112313 | |
| None | 4798 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 126685 | 11.4% |
| 102662 | 9.2% | |
| a | 79105 | 7.1% |
| e | 73163 | 6.6% |
| r | 59068 | 5.3% |
| n | 57599 | 5.2% |
| i | 56030 | 5.0% |
| o | 50916 | 4.6% |
| l | 39304 | 3.5% |
| [ | 32024 | 2.9% |
| Other values (53) | 435757 |
None
| Value | Count | Frequency (%) |
| é | 1264 | |
| á | 578 | |
| ô | 470 | 9.8% |
| í | 403 | 8.4% |
| ó | 348 | 7.3% |
| ü | 214 | 4.5% |
| ö | 206 | 4.3% |
| è | 200 | 4.2% |
| ç | 146 | 3.0% |
| ú | 145 | 3.0% |
| Other values (36) | 824 |
stars
Text
Missing 
| Distinct | 32812 |
|---|---|
| Distinct (%) | 99.0% |
| Missing | 473 |
| Missing (%) | 1.4% |
| Memory size | 262.6 KiB |
Length
| Max length | 136 |
|---|---|
| Median length | 82 |
| Mean length | 51.245238 |
| Min length | 6 |
Unique
| Unique | 32579 ? |
|---|---|
| Unique (%) | 98.3% |
Sample
| 1st row | ['Roy Scheider', 'Robert Shaw', 'Richard Dreyfuss'] |
|---|---|
| 2nd row | ['Tim Curry', 'Susan Sarandon', 'Barry Bostwick'] |
| 3rd row | ['Jack Nicholson', 'Louise Fletcher', 'Michael Berryman'] |
| 4th row | ['Al Pacino', 'John Cazale', 'Penelope Allen'] |
| 5th row | ['Warren Beatty', 'Julie Christie', 'Goldie Hawn'] |
| Value | Count | Frequency (%) |
| john | 1565 | 0.8% |
| michael | 1281 | 0.6% |
| robert | 1032 | 0.5% |
| james | 945 | 0.5% |
| david | 921 | 0.5% |
| richard | 817 | 0.4% |
| peter | 775 | 0.4% |
| lee | 654 | 0.3% |
| paul | 590 | 0.3% |
| de | 539 | 0.3% |
| Other values (34124) | 193686 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 197037 | 11.6% |
| 169678 | 10.0% | |
| a | 127567 | 7.5% |
| e | 115659 | 6.8% |
| n | 93235 | 5.5% |
| i | 85608 | 5.0% |
| r | 84999 | 5.0% |
| o | 71937 | 4.2% |
| , | 65762 | 3.9% |
| l | 61100 | 3.6% |
| Other values (105) | 625019 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 985861 | |
| Other Punctuation | 266472 | 15.7% |
| Uppercase Letter | 206795 | 12.2% |
| Space Separator | 169678 | 10.0% |
| Open Punctuation | 33127 | 2.0% |
| Close Punctuation | 33127 | 2.0% |
| Dash Punctuation | 2478 | 0.1% |
| Decimal Number | 63 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 127567 | |
| e | 115659 | |
| n | 93235 | |
| i | 85608 | 8.7% |
| r | 84999 | 8.6% |
| o | 71937 | 7.3% |
| l | 61100 | 6.2% |
| s | 44212 | 4.5% |
| t | 43929 | 4.5% |
| h | 36699 | 3.7% |
| Other values (48) | 220916 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 18510 | 9.0% |
| S | 16991 | 8.2% |
| B | 15023 | 7.3% |
| C | 14657 | 7.1% |
| J | 14555 | 7.0% |
| A | 13432 | 6.5% |
| R | 12103 | 5.9% |
| D | 11642 | 5.6% |
| L | 10476 | 5.1% |
| K | 9763 | 4.7% |
| Other values (29) | 69643 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 197037 | |
| , | 65762 | 24.7% |
| . | 1900 | 0.7% |
| " | 1766 | 0.7% |
| ! | 5 | < 0.1% |
| / | 1 | < 0.1% |
| & | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 32 | |
| 3 | 11 | 17.5% |
| 5 | 9 | 14.3% |
| 2 | 4 | 6.3% |
| 4 | 3 | 4.8% |
| 1 | 2 | 3.2% |
| 7 | 2 | 3.2% |
Space Separator
| Value | Count | Frequency (%) |
| 169678 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33127 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33127 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2478 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1192656 | |
| Common | 504945 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 127567 | 10.7% |
| e | 115659 | 9.7% |
| n | 93235 | 7.8% |
| i | 85608 | 7.2% |
| r | 84999 | 7.1% |
| o | 71937 | 6.0% |
| l | 61100 | 5.1% |
| s | 44212 | 3.7% |
| t | 43929 | 3.7% |
| h | 36699 | 3.1% |
| Other values (87) | 427711 |
Common
| Value | Count | Frequency (%) |
| ' | 197037 | |
| 169678 | ||
| , | 65762 | 13.0% |
| [ | 33127 | 6.6% |
| ] | 33127 | 6.6% |
| - | 2478 | 0.5% |
| . | 1900 | 0.4% |
| " | 1766 | 0.3% |
| 0 | 32 | < 0.1% |
| 3 | 11 | < 0.1% |
| Other values (8) | 27 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1690729 | |
| None | 6872 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 197037 | 11.7% |
| 169678 | 10.0% | |
| a | 127567 | 7.5% |
| e | 115659 | 6.8% |
| n | 93235 | 5.5% |
| i | 85608 | 5.1% |
| r | 84999 | 5.0% |
| o | 71937 | 4.3% |
| , | 65762 | 3.9% |
| l | 61100 | 3.6% |
| Other values (60) | 618147 |
None
| Value | Count | Frequency (%) |
| é | 1735 | |
| á | 861 | |
| ô | 695 | |
| í | 587 | 8.5% |
| ü | 361 | 5.3% |
| è | 358 | 5.2% |
| ó | 340 | 4.9% |
| ö | 324 | 4.7% |
| ç | 215 | 3.1% |
| ø | 180 | 2.6% |
| Other values (35) | 1216 |
genres
Text
Missing 
| Distinct | 8540 |
|---|---|
| Distinct (%) | 25.7% |
| Missing | 382 |
| Missing (%) | 1.1% |
| Memory size | 262.6 KiB |
Length
| Max length | 196 |
|---|---|
| Median length | 169 |
| Mean length | 36.094888 |
| Min length | 7 |
Unique
| Unique | 6502 ? |
|---|---|
| Unique (%) | 19.6% |
Sample
| 1st row | ['Monster Horror', 'Sea Adventure', 'Survival', 'Adventure', 'Drama', 'Horror', 'Thriller'] |
|---|---|
| 2nd row | ['Dark Comedy', 'Raunchy Comedy', 'Rock Musical', 'Supernatural Horror', 'Comedy', 'Horror', 'Musical'] |
| 3rd row | ['Medical Drama', 'Psychological Drama', 'Drama'] |
| 4th row | ['Heist', 'True Crime', 'Biography', 'Crime', 'Drama', 'Thriller'] |
| 5th row | ['Satire', 'Comedy', 'Drama'] |
| Value | Count | Frequency (%) |
| drama | 20235 | |
| comedy | 14079 | 11.8% |
| thriller | 7805 | 6.6% |
| romance | 7047 | 5.9% |
| horror | 6096 | 5.1% |
| action | 5838 | 4.9% |
| crime | 5552 | 4.7% |
| adventure | 4825 | 4.1% |
| fantasy | 3101 | 2.6% |
| mystery | 2981 | 2.5% |
| Other values (174) | 41436 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 207946 | |
| r | 90172 | 7.5% |
| 85777 | 7.2% | |
| a | 79147 | 6.6% |
| , | 70755 | 5.9% |
| e | 64753 | 5.4% |
| o | 57734 | 4.8% |
| m | 56082 | 4.7% |
| i | 46611 | 3.9% |
| y | 35514 | 3.0% |
| Other values (47) | 404509 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 640803 | |
| Other Punctuation | 278893 | |
| Uppercase Letter | 122764 | 10.2% |
| Space Separator | 85777 | 7.2% |
| Open Punctuation | 33218 | 2.8% |
| Close Punctuation | 33218 | 2.8% |
| Dash Punctuation | 4327 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 90172 | |
| a | 79147 | |
| e | 64753 | |
| o | 57734 | |
| m | 56082 | |
| i | 46611 | 7.3% |
| y | 35514 | 5.5% |
| n | 34248 | 5.3% |
| t | 31881 | 5.0% |
| c | 27426 | 4.3% |
| Other values (16) | 117235 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 25639 | |
| C | 21206 | |
| A | 13963 | |
| T | 9830 | 8.0% |
| F | 9048 | 7.4% |
| S | 8923 | 7.3% |
| H | 8461 | 6.9% |
| R | 7786 | 6.3% |
| M | 6089 | 5.0% |
| W | 3111 | 2.5% |
| Other values (14) | 8708 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 207946 | |
| , | 70755 | 25.4% |
| & | 192 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 85777 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33218 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33218 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4327 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 763567 | |
| Common | 435433 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 90172 | 11.8% |
| a | 79147 | 10.4% |
| e | 64753 | 8.5% |
| o | 57734 | 7.6% |
| m | 56082 | 7.3% |
| i | 46611 | 6.1% |
| y | 35514 | 4.7% |
| n | 34248 | 4.5% |
| t | 31881 | 4.2% |
| c | 27426 | 3.6% |
| Other values (40) | 239999 |
Common
| Value | Count | Frequency (%) |
| ' | 207946 | |
| 85777 | ||
| , | 70755 | 16.2% |
| [ | 33218 | 7.6% |
| ] | 33218 | 7.6% |
| - | 4327 | 1.0% |
| & | 192 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1198965 | |
| None | 35 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 207946 | |
| r | 90172 | 7.5% |
| 85777 | 7.2% | |
| a | 79147 | 6.6% |
| , | 70755 | 5.9% |
| e | 64753 | 5.4% |
| o | 57734 | 4.8% |
| m | 56082 | 4.7% |
| i | 46611 | 3.9% |
| y | 35514 | 3.0% |
| Other values (46) | 404474 |
None
| Value | Count | Frequency (%) |
| ō | 35 |
countries_origin
Text
Missing 
| Distinct | 2938 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 366 |
| Missing (%) | 1.1% |
| Memory size | 262.6 KiB |
Length
| Max length | 187 |
|---|---|
| Median length | 168 |
| Mean length | 19.947704 |
| Min length | 8 |
Unique
| Unique | 2249 ? |
|---|---|
| Unique (%) | 6.8% |
Sample
| 1st row | ['United States'] |
|---|---|
| 2nd row | ['United Kingdom', 'United States'] |
| 3rd row | ['United States'] |
| 4th row | ['United States'] |
| 5th row | ['United States'] |
| Value | Count | Frequency (%) |
| united | 22733 | |
| states | 18250 | |
| kingdom | 4437 | 6.0% |
| france | 3904 | 5.3% |
| italy | 2947 | 4.0% |
| germany | 2302 | 3.1% |
| canada | 1811 | 2.5% |
| india | 1771 | 2.4% |
| japan | 1445 | 2.0% |
| spain | 1125 | 1.5% |
| Other values (170) | 12789 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 95785 | |
| t | 65886 | 9.9% |
| e | 55217 | 8.3% |
| a | 46919 | 7.1% |
| n | 46031 | 6.9% |
| 40280 | 6.1% | |
| i | 36352 | 5.5% |
| [ | 33234 | 5.0% |
| ] | 33234 | 5.0% |
| d | 32786 | 4.9% |
| Other values (47) | 177218 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 372271 | |
| Other Punctuation | 110451 | 16.7% |
| Uppercase Letter | 73472 | 11.1% |
| Space Separator | 40280 | 6.1% |
| Open Punctuation | 33234 | 5.0% |
| Close Punctuation | 33234 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 65886 | |
| e | 55217 | |
| a | 46919 | |
| n | 46031 | |
| i | 36352 | |
| d | 32786 | |
| s | 21582 | 5.8% |
| r | 10946 | 2.9% |
| o | 9518 | 2.6% |
| m | 8034 | 2.2% |
| Other values (16) | 39000 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 22985 | |
| S | 20933 | |
| K | 5496 | 7.5% |
| I | 5359 | 7.3% |
| F | 4053 | 5.5% |
| C | 2766 | 3.8% |
| G | 2547 | 3.5% |
| J | 1467 | 2.0% |
| A | 1178 | 1.6% |
| W | 949 | 1.3% |
| Other values (15) | 5739 | 7.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 95785 | |
| , | 14660 | 13.3% |
| " | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 40280 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33234 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33234 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 445743 | |
| Common | 217199 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 65886 | |
| e | 55217 | |
| a | 46919 | |
| n | 46031 | |
| i | 36352 | |
| d | 32786 | 7.4% |
| U | 22985 | 5.2% |
| s | 21582 | 4.8% |
| S | 20933 | 4.7% |
| r | 10946 | 2.5% |
| Other values (41) | 86106 |
Common
| Value | Count | Frequency (%) |
| ' | 95785 | |
| 40280 | ||
| [ | 33234 | 15.3% |
| ] | 33234 | 15.3% |
| , | 14660 | 6.7% |
| " | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 662939 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 95785 | |
| t | 65886 | 9.9% |
| e | 55217 | 8.3% |
| a | 46919 | 7.1% |
| n | 46031 | 6.9% |
| 40280 | 6.1% | |
| i | 36352 | 5.5% |
| [ | 33234 | 5.0% |
| ] | 33234 | 5.0% |
| d | 32786 | 4.9% |
| Other values (46) | 177215 |
None
| Value | Count | Frequency (%) |
| ô | 3 |
Missing 
| Distinct | 12383 |
|---|---|
| Distinct (%) | 46.1% |
| Missing | 6729 |
| Missing (%) | 20.0% |
| Memory size | 262.6 KiB |
Length
| Max length | 163 |
|---|---|
| Median length | 112 |
| Mean length | 39.55491 |
| Min length | 6 |
Unique
| Unique | 9827 ? |
|---|---|
| Unique (%) | 36.6% |
Sample
| 1st row | ["Water Street, Edgartown, Martha's Vineyard, Massachusetts, USA"] |
|---|---|
| 2nd row | ['Oakley Court, Windsor Road, Oakley Green, Windsor, Berkshire, England, UK'] |
| 3rd row | ['Oregon State Mental Hospital - 2600 Center Street NE, Salem, Oregon, USA'] |
| 4th row | ['285 Prospect Park West, Brooklyn, New York City, New York, USA'] |
| 5th row | ['2270 Bowmont Drive, Beverly Hills, California, USA'] |
| Value | Count | Frequency (%) |
| usa | 11760 | 8.7% |
| california | 4613 | 3.4% |
| new | 4164 | 3.1% |
| york | 3213 | 2.4% |
| 2495 | 1.8% | |
| city | 2186 | 1.6% |
| uk | 2172 | 1.6% |
| los | 2039 | 1.5% |
| angeles | 2016 | 1.5% |
| england | 1905 | 1.4% |
| Other values (14616) | 98654 |
Most occurring characters
| Value | Count | Frequency (%) |
| 108346 | 10.2% | |
| a | 82890 | 7.8% |
| , | 60883 | 5.7% |
| e | 60598 | 5.7% |
| n | 56562 | 5.3% |
| ' | 53177 | 5.0% |
| i | 53144 | 5.0% |
| o | 52640 | 5.0% |
| r | 47842 | 4.5% |
| l | 39802 | 3.7% |
| Other values (109) | 446996 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 612203 | |
| Uppercase Letter | 154361 | 14.5% |
| Other Punctuation | 116825 | 11.0% |
| Space Separator | 108346 | 10.2% |
| Close Punctuation | 26873 | 2.5% |
| Open Punctuation | 26873 | 2.5% |
| Decimal Number | 12997 | 1.2% |
| Dash Punctuation | 4399 | 0.4% |
| Modifier Symbol | 2 | < 0.1% |
| Other Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 82890 | |
| e | 60598 | |
| n | 56562 | |
| i | 53144 | |
| o | 52640 | |
| r | 47842 | 7.8% |
| l | 39802 | 6.5% |
| t | 35482 | 5.8% |
| s | 30695 | 5.0% |
| d | 21737 | 3.6% |
| Other values (46) | 130811 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 23137 | |
| A | 19766 | |
| C | 16423 | 10.6% |
| U | 14935 | 9.7% |
| M | 7482 | 4.8% |
| B | 6973 | 4.5% |
| N | 6626 | 4.3% |
| L | 6457 | 4.2% |
| P | 5834 | 3.8% |
| I | 4360 | 2.8% |
| Other values (26) | 42368 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2705 | |
| 1 | 2506 | |
| 2 | 1598 | |
| 5 | 1284 | |
| 3 | 1113 | |
| 4 | 999 | 7.7% |
| 6 | 806 | 6.2% |
| 7 | 737 | 5.7% |
| 8 | 638 | 4.9% |
| 9 | 611 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 60883 | |
| ' | 53177 | |
| . | 1262 | 1.1% |
| " | 1214 | 1.0% |
| & | 220 | 0.2% |
| / | 58 | < 0.1% |
| # | 5 | < 0.1% |
| \ | 4 | < 0.1% |
| ; | 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 26872 | |
| ) | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 26872 | |
| ( | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 108346 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4399 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 766564 | |
| Common | 296316 | 27.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 82890 | 10.8% |
| e | 60598 | 7.9% |
| n | 56562 | 7.4% |
| i | 53144 | 6.9% |
| o | 52640 | 6.9% |
| r | 47842 | 6.2% |
| l | 39802 | 5.2% |
| t | 35482 | 4.6% |
| s | 30695 | 4.0% |
| S | 23137 | 3.0% |
| Other values (82) | 283772 |
Common
| Value | Count | Frequency (%) |
| 108346 | ||
| , | 60883 | |
| ' | 53177 | |
| ] | 26872 | 9.1% |
| [ | 26872 | 9.1% |
| - | 4399 | 1.5% |
| 0 | 2705 | 0.9% |
| 1 | 2506 | 0.8% |
| 2 | 1598 | 0.5% |
| 5 | 1284 | 0.4% |
| Other values (17) | 7674 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1059862 | |
| None | 3018 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 108346 | 10.2% | |
| a | 82890 | 7.8% |
| , | 60883 | 5.7% |
| e | 60598 | 5.7% |
| n | 56562 | 5.3% |
| ' | 53177 | 5.0% |
| i | 53144 | 5.0% |
| o | 52640 | 5.0% |
| r | 47842 | 4.5% |
| l | 39802 | 3.8% |
| Other values (68) | 443978 |
None
| Value | Count | Frequency (%) |
| é | 810 | |
| í | 394 | |
| ä | 268 | 8.9% |
| á | 179 | 5.9% |
| ó | 177 | 5.9% |
| à | 170 | 5.6% |
| ô | 147 | 4.9% |
| è | 145 | 4.8% |
| ö | 115 | 3.8% |
| æ | 85 | 2.8% |
| Other values (31) | 528 |
Missing 
| Distinct | 25940 |
|---|---|
| Distinct (%) | 80.5% |
| Missing | 1378 |
| Missing (%) | 4.1% |
| Memory size | 262.6 KiB |
Length
| Max length | 192 |
|---|---|
| Median length | 123 |
| Mean length | 47.035411 |
| Min length | 6 |
Unique
| Unique | 23844 ? |
|---|---|
| Unique (%) | 74.0% |
Sample
| 1st row | ['Zanuck/Brown Productions', 'Universal Pictures'] |
|---|---|
| 2nd row | ['Twentieth Century Fox', 'Michael White Productions'] |
| 3rd row | ['Fantasy Films', 'N.V. Zvaluw'] |
| 4th row | ['Warner Bros.', 'Artists Entertainment Complex'] |
| 5th row | ['Persky-Bright / Vista', 'Columbia Pictures', 'Rubeeker Films'] |
| Value | Count | Frequency (%) |
| productions | 10018 | 6.0% |
| films | 9926 | 5.9% |
| pictures | 7651 | 4.6% |
| film | 6423 | 3.8% |
| entertainment | 4998 | 3.0% |
| company | 1726 | 1.0% |
| international | 1372 | 0.8% |
| the | 1272 | 0.8% |
| media | 1163 | 0.7% |
| production | 1023 | 0.6% |
| Other values (19809) | 122195 |
Most occurring characters
| Value | Count | Frequency (%) |
| 135545 | 8.9% | |
| ' | 133900 | 8.8% |
| i | 103135 | 6.8% |
| e | 88556 | 5.8% |
| n | 86292 | 5.7% |
| o | 79711 | 5.3% |
| t | 78393 | 5.2% |
| r | 76800 | 5.1% |
| a | 74289 | 4.9% |
| s | 60936 | 4.0% |
| Other values (111) | 598018 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 937979 | |
| Uppercase Letter | 184884 | 12.2% |
| Other Punctuation | 178523 | 11.8% |
| Space Separator | 135545 | 8.9% |
| Open Punctuation | 35821 | 2.4% |
| Close Punctuation | 35821 | 2.4% |
| Decimal Number | 3822 | 0.3% |
| Dash Punctuation | 2821 | 0.2% |
| Math Symbol | 344 | < 0.1% |
| Other Symbol | 6 | < 0.1% |
| Other values (2) | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 103135 | |
| e | 88556 | |
| n | 86292 | |
| o | 79711 | |
| t | 78393 | 8.4% |
| r | 76800 | 8.2% |
| a | 74289 | 7.9% |
| s | 60936 | 6.5% |
| l | 48788 | 5.2% |
| m | 43624 | 4.7% |
| Other values (43) | 197455 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 27120 | |
| F | 25931 | |
| C | 19227 | 10.4% |
| S | 11508 | 6.2% |
| M | 10302 | 5.6% |
| A | 9867 | 5.3% |
| E | 9345 | 5.1% |
| B | 8051 | 4.4% |
| T | 7606 | 4.1% |
| I | 7415 | 4.0% |
| Other values (26) | 48512 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 133900 | |
| , | 35131 | 19.7% |
| . | 6571 | 3.7% |
| " | 1332 | 0.7% |
| & | 814 | 0.5% |
| / | 721 | 0.4% |
| ! | 35 | < 0.1% |
| @ | 7 | < 0.1% |
| : | 7 | < 0.1% |
| ? | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 891 | |
| 0 | 614 | |
| 1 | 596 | |
| 3 | 471 | |
| 4 | 400 | |
| 9 | 188 | 4.9% |
| 7 | 187 | 4.9% |
| 8 | 179 | 4.7% |
| 5 | 157 | 4.1% |
| 6 | 139 | 3.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 32241 | |
| ( | 3580 | 10.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 32241 | |
| ) | 3580 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 135545 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2821 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 344 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 6 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Other Number
| Value | Count | Frequency (%) |
| ² | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1122863 | |
| Common | 392712 | 25.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 103135 | 9.2% |
| e | 88556 | 7.9% |
| n | 86292 | 7.7% |
| o | 79711 | 7.1% |
| t | 78393 | 7.0% |
| r | 76800 | 6.8% |
| a | 74289 | 6.6% |
| s | 60936 | 5.4% |
| l | 48788 | 4.3% |
| m | 43624 | 3.9% |
| Other values (79) | 382339 |
Common
| Value | Count | Frequency (%) |
| 135545 | ||
| ' | 133900 | |
| , | 35131 | 8.9% |
| [ | 32241 | 8.2% |
| ] | 32241 | 8.2% |
| . | 6571 | 1.7% |
| ( | 3580 | 0.9% |
| ) | 3580 | 0.9% |
| - | 2821 | 0.7% |
| " | 1332 | 0.3% |
| Other values (22) | 5770 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1510769 | |
| None | 4806 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 135545 | 9.0% | |
| ' | 133900 | 8.9% |
| i | 103135 | 6.8% |
| e | 88556 | 5.9% |
| n | 86292 | 5.7% |
| o | 79711 | 5.3% |
| t | 78393 | 5.2% |
| r | 76800 | 5.1% |
| a | 74289 | 4.9% |
| s | 60936 | 4.0% |
| Other values (72) | 593212 |
None
| Value | Count | Frequency (%) |
| é | 2664 | |
| á | 553 | 11.5% |
| ó | 321 | 6.7% |
| í | 173 | 3.6% |
| ç | 169 | 3.5% |
| ü | 155 | 3.2% |
| É | 91 | 1.9% |
| è | 84 | 1.7% |
| ú | 73 | 1.5% |
| ñ | 72 | 1.5% |
| Other values (29) | 451 | 9.4% |
Languages
Text
Missing 
| Distinct | 2709 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 474 |
| Missing (%) | 1.4% |
| Memory size | 262.6 KiB |
Length
| Max length | 203 |
|---|---|
| Median length | 11 |
| Mean length | 15.615498 |
| Min length | 7 |
Unique
| Unique | 2042 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | ['English'] |
|---|---|
| 2nd row | ['English'] |
| 3rd row | ['English'] |
| 4th row | ['English'] |
| 5th row | ['English'] |
| Value | Count | Frequency (%) |
| english | 23409 | |
| french | 3786 | 7.9% |
| spanish | 2832 | 5.9% |
| italian | 2780 | 5.8% |
| german | 2055 | 4.3% |
| japanese | 1494 | 3.1% |
| hindi | 1361 | 2.9% |
| russian | 857 | 1.8% |
| mandarin | 783 | 1.6% |
| cantonese | 539 | 1.1% |
| Other values (241) | 7850 | 16.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 94622 | |
| n | 45793 | 8.9% |
| i | 39194 | 7.6% |
| [ | 33126 | 6.4% |
| ] | 33126 | 6.4% |
| s | 32549 | 6.3% |
| h | 32319 | 6.2% |
| l | 27685 | 5.4% |
| g | 25077 | 4.8% |
| E | 23446 | 4.5% |
| Other values (54) | 130342 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 279495 | |
| Other Punctuation | 108826 | 21.0% |
| Uppercase Letter | 47838 | 9.2% |
| Open Punctuation | 33149 | 6.4% |
| Close Punctuation | 33149 | 6.4% |
| Space Separator | 14620 | 2.8% |
| Dash Punctuation | 138 | < 0.1% |
| Decimal Number | 64 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 45793 | |
| i | 39194 | |
| s | 32549 | |
| h | 32319 | |
| l | 27685 | |
| g | 25077 | |
| a | 22397 | |
| e | 14772 | 5.3% |
| r | 10260 | 3.7% |
| c | 4932 | 1.8% |
| Other values (16) | 24517 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 23446 | |
| F | 4045 | 8.5% |
| S | 3727 | 7.8% |
| I | 2954 | 6.2% |
| G | 2449 | 5.1% |
| H | 1909 | 4.0% |
| J | 1495 | 3.1% |
| C | 1073 | 2.2% |
| R | 1016 | 2.1% |
| M | 975 | 2.0% |
| Other values (16) | 4749 | 9.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 16 | |
| 4 | 16 | |
| 5 | 16 | |
| 3 | 16 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 94622 | |
| , | 14204 | 13.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33126 | |
| ( | 23 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33126 | |
| ) | 23 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 14620 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 138 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 327333 | |
| Common | 189946 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 45793 | |
| i | 39194 | |
| s | 32549 | |
| h | 32319 | |
| l | 27685 | |
| g | 25077 | |
| E | 23446 | |
| a | 22397 | |
| e | 14772 | 4.5% |
| r | 10260 | 3.1% |
| Other values (42) | 53841 |
Common
| Value | Count | Frequency (%) |
| ' | 94622 | |
| [ | 33126 | 17.4% |
| ] | 33126 | 17.4% |
| 14620 | 7.7% | |
| , | 14204 | 7.5% |
| - | 138 | 0.1% |
| ( | 23 | < 0.1% |
| ) | 23 | < 0.1% |
| 1 | 16 | < 0.1% |
| 4 | 16 | < 0.1% |
| Other values (2) | 32 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 517279 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 94622 | |
| n | 45793 | 8.9% |
| i | 39194 | 7.6% |
| [ | 33126 | 6.4% |
| ] | 33126 | 6.4% |
| s | 32549 | 6.3% |
| h | 32319 | 6.2% |
| l | 27685 | 5.4% |
| g | 25077 | 4.8% |
| E | 23446 | 4.5% |
| Other values (54) | 130342 |
wins
Categorical
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 262.6 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 33600 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 33600 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 33600 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33600 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 33600 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 33600 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 33600 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33600 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 33600 |
nominations
Real number (ℝ)
Zeros 
| Distinct | 220 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.8503571 |
| Minimum | 0 |
|---|---|
| Maximum | 433 |
| Zeros | 23453 |
| Zeros (%) | 69.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3 |
| 95-th percentile | 23 |
| Maximum | 433 |
| Range | 433 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 17.719188 |
|---|---|
| Coefficient of variation (CV) | 3.6531719 |
| Kurtosis | 136.92517 |
| Mean | 4.8503571 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.7027983 |
| Sum | 162972 |
| Variance | 313.96963 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 23453 | |
| 2 | 1305 | 3.9% |
| 3 | 1029 | 3.1% |
| 4 | 867 | 2.6% |
| 5 | 776 | 2.3% |
| 6 | 634 | 1.9% |
| 7 | 544 | 1.6% |
| 8 | 474 | 1.4% |
| 9 | 403 | 1.2% |
| 10 | 340 | 1.0% |
| Other values (210) | 3775 | 11.2% |
| Value | Count | Frequency (%) |
| 0 | 23453 | |
| 2 | 1305 | 3.9% |
| 3 | 1029 | 3.1% |
| 4 | 867 | 2.6% |
| 5 | 776 | 2.3% |
| 6 | 634 | 1.9% |
| 7 | 544 | 1.6% |
| 8 | 474 | 1.4% |
| 9 | 403 | 1.2% |
| 10 | 340 | 1.0% |
| Value | Count | Frequency (%) |
| 433 | 1 | |
| 425 | 1 | |
| 414 | 1 | |
| 394 | 1 | |
| 382 | 1 | |
| 375 | 1 | |
| 369 | 1 | |
| 358 | 1 | |
| 350 | 1 | |
| 337 | 1 |
oscars
Real number (ℝ)
Zeros 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10261905 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 31503 |
| Zeros (%) | 93.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 262.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.50868728 |
|---|---|
| Coefficient of variation (CV) | 4.9570454 |
| Kurtosis | 91.503665 |
| Mean | 0.10261905 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.089619 |
| Sum | 3448 |
| Variance | 0.25876275 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 31503 | |
| 1 | 1422 | 4.2% |
| 2 | 363 | 1.1% |
| 3 | 145 | 0.4% |
| 4 | 74 | 0.2% |
| 5 | 45 | 0.1% |
| 6 | 19 | 0.1% |
| 7 | 17 | 0.1% |
| 8 | 5 | < 0.1% |
| 10 | 4 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 31503 | |
| 1 | 1422 | 4.2% |
| 2 | 363 | 1.1% |
| 3 | 145 | 0.4% |
| 4 | 74 | 0.2% |
| 5 | 45 | 0.1% |
| 6 | 19 | 0.1% |
| 7 | 17 | 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 2 | < 0.1% |
| 10 | 4 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 17 | 0.1% |
| 6 | 19 | 0.1% |
| 5 | 45 | 0.1% |
| 4 | 74 | 0.2% |
| 3 | 145 | 0.4% |
| 2 | 363 |
Interactions
Correlations
| MPA | Rating | Year | budget | grossWorldWide | gross_US_Canada | nominations | opening_weekend_Gross | oscars | |
|---|---|---|---|---|---|---|---|---|---|
| MPA | 1.000 | 0.096 | 0.271 | 0.000 | 0.056 | 0.061 | 0.022 | 0.056 | 0.010 |
| Rating | 0.096 | 1.000 | 0.111 | 0.104 | 0.019 | 0.027 | 0.412 | -0.139 | 0.227 |
| Year | 0.271 | 0.111 | 1.000 | 0.395 | -0.001 | -0.192 | 0.375 | -0.116 | -0.004 |
| budget | 0.000 | 0.104 | 0.395 | 1.000 | 0.706 | 0.644 | 0.329 | 0.690 | 0.121 |
| grossWorldWide | 0.056 | 0.019 | -0.001 | 0.706 | 1.000 | 0.904 | 0.319 | 0.847 | 0.174 |
| gross_US_Canada | 0.061 | 0.027 | -0.192 | 0.644 | 0.904 | 1.000 | 0.242 | 0.924 | 0.207 |
| nominations | 0.022 | 0.412 | 0.375 | 0.329 | 0.319 | 0.242 | 1.000 | 0.121 | 0.323 |
| opening_weekend_Gross | 0.056 | -0.139 | -0.116 | 0.690 | 0.847 | 0.924 | 0.121 | 1.000 | 0.106 |
| oscars | 0.010 | 0.227 | -0.004 | 0.121 | 0.174 | 0.207 | 0.323 | 0.106 | 1.000 |
Missing values
Sample
| id | Title | Movie Link | Year | Duration | MPA | Rating | Votes | budget | grossWorldWide | gross_US_Canada | opening_weekend_Gross | directors | writers | stars | genres | countries_origin | filming_locations | production_companies | Languages | wins | nominations | oscars | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | tt0073195 | Jaws | https://www.imdb.com/title/tt0073195 | 1975 | 2h 4m | PG | 8.1 | 683K | 7000000.0 | 477220580.0 | 266567580.0 | 7061513.0 | ['Steven Spielberg'] | ['Peter Benchley', 'Carl Gottlieb'] | ['Roy Scheider', 'Robert Shaw', 'Richard Dreyfuss'] | ['Monster Horror', 'Sea Adventure', 'Survival', 'Adventure', 'Drama', 'Horror', 'Thriller'] | ['United States'] | ["Water Street, Edgartown, Martha's Vineyard, Massachusetts, USA"] | ['Zanuck/Brown Productions', 'Universal Pictures'] | ['English'] | 0 | 20 | 0 |
| 1 | tt0073629 | The Rocky Horror Picture Show | https://www.imdb.com/title/tt0073629 | 1975 | 1h 40m | R | 7.4 | 173K | 1200000.0 | 115798478.0 | 112892319.0 | NaN | ['Jim Sharman'] | ["Richard O'Brien", 'Jim Sharman'] | ['Tim Curry', 'Susan Sarandon', 'Barry Bostwick'] | ['Dark Comedy', 'Raunchy Comedy', 'Rock Musical', 'Supernatural Horror', 'Comedy', 'Horror', 'Musical'] | ['United Kingdom', 'United States'] | ['Oakley Court, Windsor Road, Oakley Green, Windsor, Berkshire, England, UK'] | ['Twentieth Century Fox', 'Michael White Productions'] | ['English'] | 0 | 4 | 0 |
| 2 | tt0073486 | One Flew Over the Cuckoo's Nest | https://www.imdb.com/title/tt0073486 | 1975 | 2h 13m | R | 8.7 | 1.1M | 3000000.0 | 109115366.0 | 108981275.0 | NaN | ['Milos Forman'] | ['Lawrence Hauben', 'Bo Goldman', 'Ken Kesey'] | ['Jack Nicholson', 'Louise Fletcher', 'Michael Berryman'] | ['Medical Drama', 'Psychological Drama', 'Drama'] | ['United States'] | ['Oregon State Mental Hospital - 2600 Center Street NE, Salem, Oregon, USA'] | ['Fantasy Films', 'N.V. Zvaluw'] | ['English'] | 0 | 15 | 0 |
| 3 | tt0072890 | Dog Day Afternoon | https://www.imdb.com/title/tt0072890 | 1975 | 2h 5m | R | 8.0 | 279K | 1800000.0 | 50002721.0 | 50000000.0 | NaN | ['Sidney Lumet'] | ['Frank Pierson', 'P.F. Kluge', 'Thomas Moore'] | ['Al Pacino', 'John Cazale', 'Penelope Allen'] | ['Heist', 'True Crime', 'Biography', 'Crime', 'Drama', 'Thriller'] | ['United States'] | ['285 Prospect Park West, Brooklyn, New York City, New York, USA'] | ['Warner Bros.', 'Artists Entertainment Complex'] | ['English'] | 0 | 20 | 0 |
| 4 | tt0073692 | Shampoo | https://www.imdb.com/title/tt0073692 | 1975 | 1h 50m | R | 6.4 | 15K | 4000000.0 | 49407734.0 | 49407734.0 | NaN | ['Hal Ashby'] | ['Robert Towne', 'Warren Beatty'] | ['Warren Beatty', 'Julie Christie', 'Goldie Hawn'] | ['Satire', 'Comedy', 'Drama'] | ['United States'] | ['2270 Bowmont Drive, Beverly Hills, California, USA'] | ['Persky-Bright / Vista', 'Columbia Pictures', 'Rubeeker Films'] | ['English'] | 0 | 11 | 0 |
| 5 | tt0072081 | The Return of the Pink Panther | https://www.imdb.com/title/tt0072081 | 1975 | 1h 53m | G | 7.0 | 31K | 5000000.0 | 41833423.0 | 41833347.0 | NaN | ['Blake Edwards'] | ['Frank Waldman', 'Blake Edwards'] | ['Peter Sellers', 'Christopher Plummer', 'Catherine Schell'] | ['Farce', 'Slapstick', 'Comedy', 'Crime', 'Mystery'] | ['United Kingdom', 'United States'] | ['Palace Hotel, Gstaad, Switzerland'] | ['ITC Films', 'Jewel Productions', 'Pimlico Films'] | ['English'] | 0 | 5 | 0 |
| 6 | tt0073026 | Funny Lady | https://www.imdb.com/title/tt0073026 | 1975 | 2h 16m | PG | 6.2 | 6.1K | NaN | 39000000.0 | 39000000.0 | NaN | ['Herbert Ross'] | ['Jay Presson Allen', 'Arnold Schulman'] | ['Barbra Streisand', 'James Caan', 'Omar Sharif'] | ['Biography', 'Comedy', 'Drama', 'Musical', 'Romance'] | ['United States'] | ['Central Station, Oakland, California, USA'] | ['Columbia Pictures', 'Rastar Pictures', 'Vista'] | ['English'] | 0 | 0 | 5 |
| 7 | tt0072653 | The Apple Dumpling Gang | https://www.imdb.com/title/tt0072653 | 1975 | 1h 40m | G | 6.4 | 6.7K | NaN | 36853000.0 | 36853000.0 | NaN | ['Norman Tokar'] | ['Don Tait', 'Jack M. Bickham'] | ['Bill Bixby', 'Susan Clark', 'Don Knotts'] | ['Slapstick', 'Comedy', 'Family', 'Western'] | ['United States'] | ['Bend, Oregon, USA'] | ['Walt Disney Productions'] | ['English'] | 0 | 0 | 0 |
| 8 | tt0073812 | Tommy | https://www.imdb.com/title/tt0073812 | 1975 | 1h 51m | PG | 6.6 | 23K | 5000000.0 | 34279846.0 | 34251525.0 | NaN | ['Ken Russell'] | ['The Who', 'Ken Russell', 'Pete Townshend'] | ['Roger Daltrey', 'Ann-Margret', 'Oliver Reed'] | ['Jukebox Musical', 'Rock Musical', 'Drama', 'Musical'] | ['United Kingdom'] | ['Kings Theatre, 20-24 Albert Road, Southsea, Portsmouth, Hampshire, England, UK'] | ['Robert Stigwood Organisation Ltd.', 'Hemdale'] | ['English'] | 0 | 5 | 2 |
| 9 | tt0073802 | Three Days of the Condor | https://www.imdb.com/title/tt0073802 | 1975 | 1h 57m | R | 7.4 | 65K | 20000000.0 | 27476252.0 | 27476252.0 | NaN | ['Sydney Pollack'] | ['James Grady', 'Lorenzo Semple Jr.', 'David Rayfiel'] | ['Robert Redford', 'Faye Dunaway', 'Cliff Robertson'] | ['Political Thriller', 'Spy', 'Crime', 'Mystery', 'Thriller'] | ['United States'] | ['55 East 77th Street, Manhattan, New York City, New York, USA'] | ['Wildwood Enterprises', 'Dino De Laurentiis Company'] | ['English', 'French'] | 0 | 4 | 1 |
| id | Title | Movie Link | Year | Duration | MPA | Rating | Votes | budget | grossWorldWide | gross_US_Canada | opening_weekend_Gross | directors | writers | stars | genres | countries_origin | filming_locations | production_companies | Languages | wins | nominations | oscars | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 33590 | tt0109809 | FleshEater | https://www.imdb.com/title/tt0109809 | 1988 | 1h 28m | R | 4.9 | 2.1K | 60000.0 | NaN | NaN | NaN | ['S. William Hinzman'] | ['S. William Hinzman', 'Bill Randolph'] | ['S. William Hinzman', 'John Mowod', 'Leslie Ann Wick'] | ['B-Horror', 'Body Horror', 'Horror'] | ['United States'] | ['Beaver Falls, Pennsylvania, USA'] | ['H&G Films Ltd.', 'Hinzman'] | ['English'] | 0 | 0 | 0 |
| 33591 | tt0090582 | The Abomination | https://www.imdb.com/title/tt0090582 | 1988 | 1h 29m | NaN | 4.4 | 770 | NaN | NaN | NaN | NaN | ['Bret McCormick'] | ['Bret McCormick'] | ['Scott Davis', 'Jude Johnson', 'Blue Thompson'] | ['Horror'] | ['United States'] | ['Poolville, Texas, USA'] | ['Donna Michelle Productions'] | ['English'] | 0 | 0 | 0 |
| 33592 | tt0096876 | Bad Blood | https://www.imdb.com/title/tt0096876 | 1988 | 1h 44m | R | 4.6 | 199 | NaN | NaN | NaN | NaN | ['Chuck Vincent'] | ['Craig Horrall'] | ['Georgina Spelvin', 'Randy Spears', 'Linda Blair'] | ['Drama', 'Thriller'] | ['United States'] | ['New York City, New York, USA'] | ['Platinum Pictures (II)'] | ['English'] | 0 | 0 | 0 |
| 33593 | tt0096084 | Shadows in the Storm | https://www.imdb.com/title/tt0096084 | 1988 | 1h 25m | R | 3.9 | 264 | NaN | NaN | NaN | NaN | ['Terrell Tannen'] | ['Terrell Tannen'] | ['Ned Beatty', 'Mia Sara', 'Michael Madsen'] | ['Crime', 'Drama', 'Mystery', 'Romance', 'Thriller'] | ['United States'] | ['Camp Nelson, California, USA'] | NaN | ['English'] | 0 | 0 | 0 |
| 33594 | tt0095177 | Freeway | https://www.imdb.com/title/tt0095177 | 1988 | 1h 31m | R | 5.1 | 588 | NaN | NaN | 142671.0 | NaN | ['Francis Delia'] | ['Deanne Barkley', 'Francis Delia', 'Darrell Fetty'] | ['Darlanne Fluegel', 'James Russo', 'Billy Drago'] | ['Thriller'] | ['United States'] | ['Los Angeles, California, USA'] | ['Gower Street Pictures'] | ['English'] | 0 | 0 | 0 |
| 33595 | tt0094076 | The South | https://www.imdb.com/title/tt0094076 | 1988 | 2h 7m | R | 7.3 | 1.1K | NaN | NaN | NaN | NaN | ['Fernando E. Solanas'] | ['Fernando E. Solanas'] | ['Susú Pecoraro', 'Miguel Ángel Solá', 'Philippe Léotard'] | ['Drama'] | ['Argentina', 'France'] | ['Buenos Aires, Federal District, Argentina'] | ['Canal+', 'Cinesur (Envar El Kadri)', 'Productions Pacific'] | ['Spanish', 'French'] | 0 | 2 | 0 |
| 33596 | tt0256664 | El cabaretero y sus golfas | https://www.imdb.com/title/tt0256664 | 1988 | 1h 25m | NaN | 4.9 | 12 | NaN | NaN | NaN | NaN | ['Raúl Ramírez'] | ['Raúl Marcelo'] | ['Raúl Ramírez', 'Raúl Marcelo', 'Marcela Daviland'] | ['Comedy'] | ['Mexico'] | NaN | NaN | ['Spanish'] | 0 | 0 | 0 |
| 33597 | tt0353261 | BraveStarr: The Legend | https://www.imdb.com/title/tt0353261 | 1988 | 1h 31m | PG | 6.8 | 1.3K | NaN | NaN | NaN | NaN | ['Tom Tataranowicz'] | ['Bob Forward', 'Steve Hayes'] | ['Charlie Adler', 'Susan Blu', 'Pat Fraley'] | ['Superhero', 'Action', 'Adventure', 'Animation', 'Comedy', 'Family', 'Fantasy', 'Sci-Fi', 'Western'] | ['United States'] | NaN | ['Filmation Associates'] | ['English'] | 0 | 0 | 0 |
| 33598 | tt0098474 | Fighting Madam 2 | https://www.imdb.com/title/tt0098474 | 1988 | 1h 30m | NaN | 6.3 | 337 | NaN | NaN | NaN | NaN | ['Teresa Woo'] | ['William Hsu', 'Teresa Woo', 'Larry Dolgin'] | ['Alex Fong', 'Moon Lee', 'Elaine Lui'] | ['Action'] | ['Hong Kong'] | ['Kuala Lumpur, Malaysia'] | ['Molesworth Limited'] | ['Cantonese', 'Mandarin'] | 0 | 0 | 0 |
| 33599 | tt0096039 | Saturday the 14th Strikes Back | https://www.imdb.com/title/tt0096039 | 1988 | 1h 18m | PG | 3.1 | 737 | NaN | NaN | NaN | NaN | ['Howard R. Cohen'] | ['Howard R. Cohen'] | ['Ray Walston', 'Avery Schreiber', 'Patty McCormack'] | ['Comedy', 'Fantasy', 'Horror', 'Sci-Fi'] | ['United States'] | ['Venice, California, USA'] | ['Pacific Trust'] | ['English'] | 0 | 0 | 0 |